1 st Workshop on Intelligent and Knowledge oriented Technologies

نویسندگان

  • Michal Laclavík
  • Ivana Budinská
  • Ladislav Hluchý
  • Zoltán Balogh
  • Marián Babík
چکیده

Levenshtein edit operation is a basic string operation – insertion, deletion or substitution of a character in a string. Sequence of edit operations can be used to transform basic word form (lemma) into an inflected form, and the same sequence can can be used to transform lemmata belonging to the same inflectional paradigm. Presented system contains inflection paradigms of over 56000 lemmata from Short Dictionary of Slovak Language and from the most frequent word forms in the Slovak National Corpus, together with detailed grammar information about each generated word form. 1 Levenshtein distance and some definitions Levenshtein distance[1] is a metric defined on the space of strings as a minimum number of Levenshtein edit operations needed to transform one string into the other, where by a Levenshtein edit operation we understand insertion, deletion or a substitution of a character. A Levenshtein edit operation e can be formally described as e = (o, s, d) – a triple of operation type o, position in the source string s and position in the destination string d, where operation type o is one of replace, insert or delete. For replace or insert, the replacement/new character is taken from the destination string. Sequence of edit operations q = (e1, e2, e3, ...), together with the destination string D, when applied to a string S ∈ S defines a mapping function f : S 7→ S, where S is a set of all strings. To each word form w ∈ W, where W is a set of all the words we can assign a set of grammar categories Gw = {g1, g2, g3, ...} represented by short mnemotechnical strings (called morphological tags). Now for each tagged word form together with its morphological tag (wi, gi) ∈ W × G there exists a mapping function fi consisting of Levenshtein edit operations such that fi(l) = wi, where l is a deliberately chosen word, called lemma and considered to be a basic word form for a given lexeme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classroom-Oriented Higher Education System or Workshop-Oriented Higher Education System (Based on Cost & Economic Approach)

The most important goal of each society, is to reach economic development. As the goal and agent of development, man has got an important responsibility, which responsibility is realized by way of education, specially higher education, because the universities are the main factors for progress, production of knowledge and education of specialized human forces and they play a significant role in...

متن کامل

The Symbiosis of Human and Semantic Technology Through the Lens of Actor-Network Theory

Background:  Semantic technologies (STs) have made machine reasoning possible by providing intelligent data management methods. This capability has created new forms of interaction between humans and STs, which is called "semantic interaction."  The increasing spread of this form of interaction in daily life reveals the need to identify the factors affecting it and introduce the requirements of...

متن کامل

Knowledge Acquisition Tools for Intelligent Tutoring Systems

The EDULAN project focuses in the field of Computer Assisted Learning and aims to generalize and translate to web-oriented technologies some results obtained previously on the Intelligent Tutoring Systems area. Particularly, four complementary research lines are being considered: knowledge acquisition tools for building Intelligent Tutoring Systems; adaptative and Web-oriented teaching/learning...

متن کامل

Selected Topics on Information Logistics: Editorial Introduction to the Issue 2 of CSIMQ

While the amount of information relevant for enterprises and organizations grows ever more, the decisions and operational tasks depending on information are becoming more complex. Accurate and readily available information is indispensable in problem solving, decision-making, and knowledge-intensive work. Studies on information use show that information overload is perceived as a problem in org...

متن کامل

Proceedings of the 1 st workshop on Emotion and Computing – Current Research and Future Impact

Emotion-oriented computing is a broad research area involving many disciplines. The network of excellence HUMAINE is currently making a co-ordinated effort to come to a shared understanding of the issues involved, and to propose exemplary research methods in the various areas. This overview paper presents a proposed “map” of the research area, distinguishing core technologies from application-o...

متن کامل

Sw-el'05: Applications of Semantic Web Technologies for E-learning in Conjunction with 12th International Conference on Artificial Intelligence in Education (aied'05) Special Session on Semantic Web for Adaptive Learning Environments Session Co-chairs: Special Session on Semantic Web-based Educational Information Systems

ii Preface The AIED'05 session of the SW-EL'05 workshop focuses on Semantic Web-based knowledge representation and engineering approaches and methods for the needs of intelligent learning systems and discusses issues related to their use for content and knowledge components specification, effective intelligent courseware construction and modelling the learner. The following topics are addressed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006